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Approximations of Nonlinear Sy terns Having. Outputs 
L.R. Hunt and Renjeng Su 

Abstract 

For a nonlinear system with output 

x = f (x) 
y = h (x ) , 

two types of linearizations about a point Xq in state space are con- 
sidered. One is the usual Taylor series approximation, and the other 
is defined by linearizing the appropriate Lie derivatives of the out- 
put with respect to f about x^ . The latter is called the observation 
model and appears to be quite natural for observation. It is inter- 
esting that there is a coordinate system in which these two kinds of 
linearizations agree. In this coordinate system we introduce a tech- 
nique to construct an observer. 
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Approximations Of Nonlinear Systems Having Outputs 
L.R. Hunt* and Renjeng Su** 

I. Introduction 

A much used technique for dealing with nonlinear systems is to 
linearize the system about a point or trajectory. That is, we ap- 
proximate the nonlinear system by a linear system derived using 
Taylor series expansions about the point or the trajectory. 

Given a nonlinear system with nonlinear output 

x = f (x) 

(1) Y = h(x) 

oo n o° p 

where f is a C vector field on.E and h(x) is a C Ir -valued func- 
tion, p _>1, what linear approximation about a point Xq in state 
space best reflects the local observability properties of system 

(1) ? Under certain generic conditions we construct a linear system 
of the form 

x = f (x Q ) - AXp + Ax 

(2) y = h(Xg) - CXq + Cx, 

which seems appropriate for use in state estimation. For example 
if p = 1 ( (1) is a single output system) and the gradients of h and 
the first (n-1) Lie derivatives of h with respect to f at x^ are 
linearly independent, then A and C in (2) are chosen so that 
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dh(x Q ) = C 
dLfh(Xg) = CA 

dL^h (x Q ) = CA n , 

where we think of the gradients on the left hand side as n vectors, 
k 

Here L^h(x^) denotes the kth Lie derivative of h with respect to f 
at Xq. The importance of these Lie derivatives for observability is 
stressed in [4] and [6] . 

If Xq is an equilibrium point of f, then A and C can be found 
by the usual Taylor series approach. However, in general our tech- 
nique and the Taylor series method yield different A and C matrices. 
We present an example to show this is the case. 

The A and C matrices in the linear approximation (2) of the 
nonlinear system (1) are defined by nonlinear equations. However, 
there is a orthogonal linear coordinate change to new coordinates 
in which the computations of A and C are trivial. 

Suppose we take a trajectory cp(t) of the equation x = f (x) from 
(1) and let cp.(tg) = x^ be a point on the trajectory. If we line- . 
arize system (1) about <p(t) in the usual way, take appropriate Lie 
derivatives of the output function w = y - h(tp(t)) and evaluate 
these when t = tg, we find that the equations obtained are exactly 
those used to define matrices A and C in our method. This is one 
reason why we believe that this linearizing technique will be use- 
ful in our automatic flight control research at NASA Ames Research 
Center. 

Because of noise it is obvious that the output function cannot 
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be differentiated on line. However, we hope that system (2) is 
"sufficiently close" to system (1) , that a Luenberger observer built 
using our A and C matrices will yield satisfying estimates near x^. 
However, a problem arises because the Luenberger observer requires 
choosing a gain I\ to stabilize the linear part of the difference be- 
tween the plant and the estimator system. For this stabilization, 
the usual Taylor series approach about appears to more appropri- 
ate than our observation model. Hence our competing linearizations 
both have their good and bad points. Fortunately, there is a coor- 
dinate system on H n where the Taylor series linearization and the 
observation model about x Q agree. It is in this coordinate system 
that we shall construct the observer. Practically, we cannot always 
find the inverse of the exact coordinate transformation, but we can 
build its linear part about the appropriate point. No claims are 
made about our approach being guaranteed to work in practice, but 
it does seem promising. Most problems that arise for our scheme 
also are evident in any other attempt at nonlinear observation. 

For generalized Luenberger observers and mappings of nonlinear 
systems to other systems for which observers can be constructed we 
refer to Krener and Isidori [7], Pioneering research concerning 
observability of nonlinear systems can be found in Kou, Elliott, 
and Tarn [6] and Hermann and KJfegner [4]. Other excellent references 
appearing in the literature include the recent work of Nijmeijer 
[9] , [10] . 

This technique for finding approximating linear systems for 
observability parallels our efforts [2] , [5] for establishing linear 
approximations to the nonlinear control system 
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m 

(3) x = f(x(t)) + l u . (t) g . (x (t) ) . 

i=l 1 


In future research we will add an output function to system (3) 
and ask for conditions under which the estimation technique intro- 
duced in this paper is applicable. 

For our work at NASA Ames, trajectories and controls to fly 
these trajectories are computed in an on board computer. It is im- 
possible to linearize around a trajectory in advance. However, the 
scheme developed here may prove applicable in this situation. 

In section 2 of this paper we give definitions and prelimi- 
naries and introduce our linear approximating system (2) . Section 
3 contains a technique to compute the A and C matrices of (2) and a 
result relating our approximation to the usual linearization about 
a trajectory provided that the trajectory is known. We also intro- 
duce a technique that can prove useful in constructing an’ observer. 

The paper ends with a comparison of the time responses of the out^ 

I 

puts of the nonlinear system and the observation model starting at 
the pointp x^ 

II. Definitions and Preliminaries 

co . . n 

Let a(x) be a C real-valued function on IR . . For our vector 


field 


we let 





+ • 


• + 


9a 

9x 


n 



<da,f> = 


i- - i d. . . 


the Lie derivative of a with respect to f , denoted by L f a(x) or 

L d a (x) . Since <da,f> is real-valued, we can again differentiate 

2 

with respect to f to obtain L^a(x) . This process can be continued 
by induction to yield L^a (x) , L^a (x) , etc. 

We write the R^-valued function y = h(x) in (1) as 
h = (h 1 ,h 2 , . . . ,h •) . The following assumption is made for the re- 
mainer of this paper. There exist real numbers &]_ ' $2 ' * * ' ' ^p — ® so 
that 8 -^+B 2 + * • - + 3 n = n an( 3- the set 

3,-1 


2 1 

{dh^ , dL f h^ , dL^hj, , . . . ,dL^ h^ , dh 2 , dL f h 2 , 


2 6 2~ 1 2 3 "i 

(4) dL f h 2 , . . . ,dL f h 2 , . . . ,dhp,dL f h^,dL f h^, . . . ,dL f p h^} 

is linearly independent on R n or on an open set in IR n containing 

the point x Q of interest. Since we can renumber h^,h 2 ,...,h , no 

generality is loss in supposing that 3^ .^1^2 — ’ ' ^p' w ^ ere 3p is 

3-1 

the first integer such that dL^ p h is linear combination of 

f P 

3-1 

dh ,dh. w . . . ,dh , dL^h, ,dL,-h,, , . . . ,dL^h , . . . ,dL^t h, , . . . , 

1 2 p f 1 f 2 f p f 1 


6-1 


3-1 


dL f P h 2 ' ••* ,dL f P h D— 1 ’ 


f 2 ‘ 

3 — i -1 


p -l"p-l 


is the first integer such that 


dL ^ h , is a linear combination of dh, ,dh~ , . . . ,dh , dL^h, , 
t p— 1 p r l 

3 -l B -1 3 _ 1 ~1 

dL f h 2 , . . . ,dLfhp, . . . ,dLf^~ h-^,dL^ P h2/-**^dL^ P ^p-2' etc * 

We assume these linear combinations hold on H n or on an open subset 
of B n containing x^ . 

Under the above conditions there exists a neighborhood of x^ in 
which we can distinguish any two points (see [4] and [7]). This is 
called strong local observability in [ 10 ] . I 


In system (2) we let the rows of the pxn matrix C be 
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C^,C 2 , . . . ,C , respectively. 

Definition 2.1. Given system (1) and a point in state space, 
the linear system (2) is called the observation model at x Q if A 
and C are defined by 


(5) 



dhj ( x q) i j — 1 / 2 , . . . ,p 
k. 

dL^h_. (Xq) f j 1/2, ...,p and k 



It is easy to show that if x Q is an equilibrium point of f, 
then A and C in the observation model agree with A and C computed 
by the linear parts of Taylor series expansions of f and h about x Q . 

We illustrate by an example that the observation model and 
Taylor series approach may differ for a general point x^. 

3 

Example 2.2. We take the system on R 


( 6 ) 


x. 


X, 


X 


X, 


X. 


y = 


. 3J 

h (x) 


0 

sin x 


1 ' 


» • TT 

and find its observation model at the point Xq = C^-,1,1) . Com- 
puting we have 


i 


I 


I 


c 

= dh (Xq) = 

't°'° 


CA 

= dL f h(x Q ) 


r » / 2 
2' 

( 1 

o 

(cn|cn 

CA 2 

= dL^h (X q) 

= 

n, 

- 


CA 3 

= dL 3 h(x Q ) 

= 

[- /2, 

-3/2, - 3/S] 


Thus 


(7) 



Hence the observation model about is 



x = f (x Q ) 

- Ax 0 

+Ax 

(8) 

y = h (x Q ) 

- Cx 0 

+ Cx 


with A and C as above. If we linearize f and h about we find 



X = f (x Q ) 

1 

r 

O 

+ Ax 

(9) 

y = h(x Q ) 

- CX Q 

A 

+ Cx 


with 


A 


'0 10 
0 0 1 
0 0 0 
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c = 




CIO) 


. 1 — 
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Therefore the two techniques yield different linear sytems (8) and 
(9) . 

Returning to equations (6) we compute the output and its first 
two time derivatives 


( 11 ) 


y = h (x) = sin 
y = L^h(x) = (cos x^Xj 

••2 2 

y = L f h(x) = (-sin x^) x 2 + (cos x^x^. 


2 

The inverse function theorem (dh,dL^h, and dL^h are linearly inde- 
pendent at Xq) implies that we can solve these equations for all x 
near x Q . Linearizing (11) about x Q we find 



h (x) 

/2 /2 , 

" -2 + T (x l - 

~) 

4 



L f h(x) 

/2 /2 , 

" ~2 - T <x l - 

I) 

+ ^|(x 2 -l) 

(12) 

L^h (x) 

» - /5(x 1 - J) 

- 

/2(x 2 -1) + ~(x 3 -l) 


We take our observation model with matrices A and C as in (7) 
and compute the corresponding time derivatives of the output. This 
process yields 


(13) 


y 

y 

• • 

y 


/2 /2 , it . 

~2 + T (x r 


/2 

2 


4 (x r + 


S2, 


-/2(x 1 - ~) — /2 (x 2 ~1) + — (x 3 -l) 


exactly as in (12) . 
linear part about Xq 


Therefore the observation model produces the 
of the important equations (.11) . 



Writing equations (9) from the usual Taylor series approach we 

have 


y 



Taking the last equation and its first two time derivatives gives 
us 



• • • 

Hence y and y are not good approximations to the corresponding equa- 
tions in (12) . 

Given system (1) and a point Xq there is a coordinate change on 
R n such that in the new coordinates the observation model and the 
Taylor series linearization agree. Recall that we assumed thatthe 
gradients of the functions in (4) are linearly independent. The 
following coordinate transformations are known in the literature 


(see [3] and [7] ) . 

We set T x = h lf T 2 = L f h L ,T 3 = Ljjh-^..., T ^ 

b,-i 


Bl“i 

L f h l ,T B 1 +l = h 2 


V+2" L f h 2'*‘-' T e ,+B 

V 1 

T n = h . In these T coordinates system (1) becomes 


If h„ , . . . , T oii — h , T 0 ~ — L^-h , . . . 

f 2 n-3 +1 p n-$ +2 f p' 

n ^ n ^ 


i 
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( 14 ) 


TV 

x 


= T 


i+1 ' 


i * 





y = 


[t l ,t 


e 1 +i' 


1 , 2 , 

• . ,T 


with each being a function of (T^, 

i = 1,2,..., p are functions of (x 1 ,x 2 

are functions of T, ,T„,...,T by the 

12 n 1 

rewrite system (14) as 


+ $ 2 , . . . , n 
• • • , P , 


n-$ +1 ] ~ [h l' h 2' * * * ' h p ] ' 
n 


T 2 , . . - , T n ) ’ Actually the F^ 
,...,x^) , but ,x 2 , . . • ,x n , 
inverse function theorem. We 


T i = f ' (T l' T 2 T n ), 


(15) 


Y [T 1 ,T 3 1 +1" " ,T n-B„-l ] ty l ,Y 2 f ' ' ‘ ,Y p ] 


n 


the definition of f' being obvious from (14) . If we take the Lie 
derivatives of y^ with respect to f 1 through order £L-1 (i.e. we are 
taking the time derivatives of y^ through order 3^-1) we find only 
linear equations. For a point x^ = in state space, if we start 
with system (15) , then the Taylor series linearization and the ob- 
servation model produce the same linear system. 

The major problem with using this transformation to T coordi- 
nates in practice is that is is difficult to solve and invert non- 
linear equations. In other words, the major problem is in construc- 
ting the inverse of the transf ormation and the functions 
F^,i = 1,2,..., p, as they depend on T g , T 2 • • * • , T n - However, as we 
show later, it is important that we can build the inverse of the 
linear part (about a point Xq) of the transformation and the linear 
part of F 1 ,F 2 ,...,F d about T(x^) = T Q . 

We now develop a method to solve the nonlinear equations (5) : 
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for A and C. 

III. Observation Model Calculations 
We consider system (1) and its observation model (2) where A and 
C are defined in equations (5) . The set D is defined as 


13 , 6,-1 6 2 B 2 

{dL f h 1 (x Q ),dL f h 1 (x Q ) / ,dL f h 1 (x Q ),dL f h 2 (x Q ),dL f 


V 1 


h l(x 0 ). 


6 2 -l 6-, 6-, 6o 6-5-1 

dL f h 2 (x Q ) , . . . ,dL f h x (x Q ) ,dL f h 2 (x Q ) ,dL f h 3 (x Q ) ,dL f h 1 (x Q ) , 


dL 


6 3 -! 


6 3 -l 


f h 2 (x 0 ),aL f h 3 (x Q ) , . . . ,dh 1 (x Q ) ,dh 2 (x Q ) , . . . ,dh p (x 0 ) } . 


Before forming this set checks such as 6^ = or 6, > B 2 , etc. 
should be made and no duplications should be included. For example 

$1 B 2 B 2 ~l 

if = B 2 , then D begins with dL f h 1 (x Q ),dL f h 2 (x 0 ),dL f h^fx^), 


dL 


* 2-1 


£ h 2 (x Q ),-..., provided 3 2 > 6 3 • Our ordering of this set reflects 

descending superscripts on and ascending subscripts on the h^ . 

This set D consists of the elements contained in the set described 

" ^1 ^2 ^p 

in (4) plus dL f h 1 (x Q ),dL f h 2 (x Q ) , . . . ,dL f F h p (x Q ) . 

Now we introduce an interesting (n+p)x(n+p) matrix E. The first 
row of E is the first element of D followed by p zeroes, the second 
row of E is the second element of D followed by p zeroes,..., the 
kth row of E is the kth element of D followed by p zeroes,..., the 
last row of E consists of dh (x^) and p zeroes. 

It is proved in U1 that there, are orthogonal coordinate changes 

on !R n (actually [q j] on ]R n+ P where A is orthogonal on :R n arid I is 
the identity on R 13 ) that take E to "generalized upper Hessenberg” 
form (certain adjustments need to be made first) . In fact the 
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computer codes to do this are included in [1] , and for the single 
input case (p=l) this is due to Stewart [11]. By an (n+p) x (n+p) 

generalized upper Hessenberg matrix we mean that all elements below 

, th . ■, v , _ , . . 

the p-f— subdiagonal are zero. In addition, if p > 1 and a zero 

i th th 

appears in the p— subdiagonal and i~ row, then all entries in the 
p— subdiagonal above the i— row are zero. . If (p-1) >1, and if a 
zero appears in the (p-l)~ subdiagonal and the j — row, j ^>i, then 
all entries in the (p-1) — subdiagonal above the j— row are also 

i 

zero. This can be continued if (p-1) >2, etc. Also the last p 
columns remain zero in our matrix E after the coordinate change. 

To calculate A and C as in equations (5) , we assume that we 
start in the coordinate system where the matrix E is generalized 
upper Hessenberg. Otherwise we can compute the orthogonal coordi- 
nate change to move to these desirable coordinates, calculate A and 
C, and then return to the original coordinates through the trans- 
pose (inverse) of the orthogonal change. In this form the zero in 

th th 

the p — subdiagonal is in the same row as dL^ h^tx^), in the (p-1) 

Bp-1 

subdiagonal is in the same row as dL, r h . (x,.) , etc. 

t p-i u 

We let 


a ll 

a i2 ’ 

. a. 
In 

a 21 

a 22 * 

. a~ 
2n 

_ a nl 

a n2 * 

. a 

nn 

" C ll 

C 12 * 

. c. 
In 

C 21 

C 22 * 

. c~ 
2n 

- C P1 

c _ . 

p2 

. c 

pn 


( 16 ) 



13 


Now, by the form of E (forgetting the last p columns) and the 
equations (5) 


C = 


0 


0 0 
0 0 


0 


* * 


0 * * 
0 0 * 


where * represents possible nonzero numbers (the first. * in each row 

must be nonzero by our assumptions on dh^ (x^) ,dh 2 (x^) , . . . ,dh p (x^) . 

th. 

The * in the first row of C is in the (n-p+1) — column. Then we 
know our matrix C having rows ' ^ 2 ' ' " ' ' ^ * 

From (5) 


! CpA = dL^h p (x Q ) = [0. . .0*. . .**] , . 

th 

where the first * is nonzero and in the (n-p+2) — column. This is 


[0 0 . . . 0 0 *] 


a ll a 12 


a 21 a 22 


a , a „ 
nl n2 


. a 


In 


. a 


2n 


. a 


nn 


0 ... 0 * ... * *] , 


and we find a , ,a a where a .,a -,..., a , . , . are zero, 

nl n2 nn nl n2' n (n-p+1) 

In a similar manner, C ,A = dL^h . (x„) yields 

p-1 f p-1 0 J 


a (n-l)l ,a (n-l)2' * ‘ * ' a (n-1) n 


, .. . , C^A = dL^h 1 (x Q ) yields 


a (n-p+1) l' a (n-p+1) 2 
Next, if 6 >2, 

' p ' 


/•••/' 


(n-p+1) n ‘ 


c p A 2 * (C p A)A = dL 2 h p (x 0 ) 
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allows us to find a, . , a , a, . . Continuing in this 

(n-p) 1 (n-p) 2 (n-p)n - 

k k—1 

way we can solve for all entries in A. Viewing C^A = (C^A )A = 
((dL^hj) (Xp))A in solving for A, the rows of the coefficient matrix 
are the elements in the set (4) , which we assumed are linearly in- 
dependent . 

Summarizing our technique, we use orthogonal coordinate changes 
on IR n so that solving for our A and C matrices defined in (5) be- 
comes extremely easy. 

As mentioned in the introduction, we may not know the trajectory 
of x = f (x) in system (1) in advance so that we can do off-line com- 
putations. However, if such a trajectory is known, it is interesting 
to study the relationship between the usual linear variational equa- 
tion and the observation model at a point on the trajectory. In 
fact, the results are quite surprising. 

Let cp(t) be a solution of x = f(x(t)). Set 

z = x-cp(t) , w = y-h (<p(t) ) , F (t) = .f§(tP(t) ) / and H (t) = ~C<p(t) ) . 

Here ~ is the Jacobian matrix and 4— is the pxn matrix with i— 

9x 9x ^ 


row [ 


3h. 3h. 
x i 


ah. 


9x. ' 9x~ ' ' * ' ' 9x 
12 n 


— ] . Then the linear variational equation along 


4>(t)) is 


(17) 


z = F ( t) z 
w = H ( t ) z , 


For simplicity we state and prove a theorem for a single output 
(p=l) system, but all arguments generalize to the multi-output case. 

Theorem 3.1 . Take the linear variational equation (17) and compute 

I 

the first (n-1) time derivatives at a point x^ = ^(.tQ) • Solving 
the output equation for y and the first (n-1) time derivatives • ' ' 


I 
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• •• • ( 9) •( n — X ) 

equation y,y = y , . . . ,y respectively, we obtain the same set 

of equations as taking the observation model (2) (with A and C de- 
fined by (5) ) and the first (n-1) Lie derivatives of the output. 


Proof . The system (17) has output function 


y-h (q> ( t) ) = w = H ( t) z . 


At tp(t Q ) = x Q this becomes 


y = h(x o> + l$ Cx o’ (x - x o> 


= h (x ) + C (x-x Q ) , 


the use of vector notation being obvious. 
We compute 


3h (<p ( t ) ) _ /. _ 9H(t) „ , 
y - w z + H ( t ) z 


at time tQ(tp(tg) = x^) to find 

y = H (x o )f(x o> + ti Ix o ,£,x o i (x-x o ) + i tx o>M (x o )tJt - x o> 

3x 

= dh (Xq) f (Xg) + dL^h(x Q ) (x-x Q ) 

= Cf (X q ) + CA (x-x Q ) . 


Similarly , 


y - 


3 h(cp(t) ) 
3t 2 


3 H (t) , „ 3H (t) • , TT 

w = 3 ^ — z + 2 — 3 ^ — z + H ( t) z 


for tp ( 1 0 ) = x Q yields 
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Y = 


dLjh(x Q ) f (x Q ) + dL^h(x Q ) (x-x Q ) 
CAf(x Q ) + CA 2 (x-x q ). 


An induction step shows that 


= CA k ^(Xg) + CA k (x-x Q ) 


for 1 _< k _< n - 1 . 

Suppose we take system (2) and consider the Lie derivatives of 
the output y with respect to the vector field f (x^) -Ax^+Ax . We find 

y = h(x Q ) + C (x-Xq) 
y = Cf (Xq) + CA(x-Xq), 
y = CAf (X q ) + CA 2 (x-Xq) 

and 

y (k) = CA k_1 f{x 0 ) + CA k (x-x Q ), 

for 1 _< k £ n -1 . 

Q.E.D. 

The computations of Example 2.2 and Theorem 3.1 verify certain 
i advantages of the observation model over the usual Taylor series 

i 

approach. 1 In fact, if one is interested in state estimation for 
'system (1) using a linear approximation about a point x^, we have 
illustrated the importance of the observation model. 

At the end of section 2 we introduced a set of coordinate changes 
T = (T 1 ,T 2 , . . . ,T ) on !R n so that the system (.1) has a very simple; ■ 
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form. Moreover, in these T coordinates, the Taylor series approach 
and the observation model about a point T g = T(Xq) yield the same 
linear system. We are interested in using the T coordinate changes' 
and system in T coordinates to construct Luenberger observers. 

Assume that we know the inverse transformation T ^ and the func- 
tions , F 2 , . . . , Fp (as functions of ^ , T^) in system (14). 

We start in x coordinates, but since the outputs of systems (1) and 
(14) are the same., we suppose that our plant is in the T coordinates 


Hence we estimate the state T and find estimates x in the original 
coordinates through T ^ . We work near x Q in R n and let T(Xq) = Tq 
as above and x(T Q ) = x Q . 

Since Luenberger observers are constructed for linear systems we 
linearize (14) by a first degree Taylor polynomial about T Q (this is 
the same as the observation model about Tq) to obtain 


T> .= T> 

‘i i+1 

*6. = F i (x( V> 


+ 


i 7 ^ $1 * 3i t 
3F 

gj i (x(T 0 ) ) (T-Tq) , i = 1,2, 


rP, 


(18) 


[T l' T &.j+r 


,T n-e + 1 ] 
n 


Because the outputs in (14) are linear, no linearization of the out- 
put functions is required. 

A 

Set Tq = Tq and define a new system 


A 



/\ 

y 


T i +1 + Vy*-y*> ' 1 ^ 3 i ,e i + B 2 ,.-.,n 
f ± ( x (t q )) +||(x(Tq) ) (T-Tq) + K g (y*-y*) , 


[T X ,T 


*l +v 


■ ,T n -8 + 1 ] ' 
n 


i 


1 , 2 , 


/P/ 


(19) 
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M .. . 




where * denotes transpose and the are lxp matrices Which remain 
to be chosen. 

/\ 

Let T = T - T and compute 

(20) t ± = T i+1 - T ±+1 - K i (y*-y*), i / 3 1 ,e l + P 2 ,...,n 

8F. 

F ~^(x(T 0 ) ) (T-T) - K & (y *-y *) , i - 1,2, . .. ,p. 


By our assumption of linear independence on the set (4) , the linear 
system (18) is observable, and , K 2 / • • • , can be taken to stabilize 

/V t ( A 

system (20) about T = 0. As mentioned previously, estimates x in 

-1 ^ 

the original x coordinates can be computed from T (T) . 

Suppose now that we do not know T ^ or F p , F 2 ' ' ’ ’ ' F p as f unct i° ns 
of , T 2 , . . . , T n . It is interesting that we can derive equations 
(18) , (19) , and (20) and repeat the above process. However, after 

A A 

obtaining the estimate for T, we approximate x through the universe 

of the linear part of T about x Q instead of T 1 itself. Recall the 

definitions of T, ,T_,...,T and F n ,F 0 ,...,F from section 2. Form 

12 n 12 p 

the set D and the matrix E as in the beginning of this section. Use 
the orthogonal coordinate changes on !R n to move E to generalized 
upper Hessenberg form. Since this coordinate change is orthogonal, 
when taking its inverse we need only examine its transpose. For 
this reason we assume that the linear orthogonal coordinate change 
has been done in system (1) . 


Thus we have 
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dT n-e +l (x 0 } ~ dh p (x 0 ) - [0/0/ • • • /0/*j 
n ^ 

dT n - ( 6 ,+B )+I ( V = d Vl ,X o’ = t0 '° 

n-1 n . ^ 

dT l(x 0 ) = dh 1 (x Q ) = [0,0,...,*, ,*]. 


( 21 ) 


dT +2 = dL f h p (x 0 } = [0 '°" /*] 
P n ^ 

: 3 x -i 

dT g = dL f X h 1 (x Q ) = 


where the first * in each right hand expression is nonzero and these 
move one place to the left as we move from one line to the next. 
Solving the equations 

? T n-3 + 1 (X 0 ) 

T n-|3 +1 (X) = T n- 6 tl^O 5 + 3x ( V X nO ) + ,,> 

n n n 


T n- ( g ,.+ B )+l (x) T n- ( 3 • ,+3 )+l (x) + 
n-1 n n-1 n 


,)T n- (6 1 +3 n )+l (X 0 ) 

n-1 n 


3x 


n-1 


3T n-(3 1 +3 n )+l (x 0 ) 

(X n-l" x (n-l)0 ) + 3x~ ( ' X n" X nO ) + * * * 


n 


( 22 ) 


T B 1 = T B 1 (x 0 ) + «ST f , i < x 0 ). ( x-x 0 )> +... 


by ignoring the higher order terms (+...), we find 
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9T n-B +1 (X o’ 

x n -x n 0 = <T n-B + r T n-6 +X (x O )) — 9^ 

n M n n 


X n-1 -X (n-1) 0 (T n-(B n +B )+l T n- ( 6 ,+8 )+l 

n— 1 n n-l n 


(23! 


3T n-(6 +8 )+l ( 0 } 3T n- ( 8 ,+B )+l ( 0 } 

n ~ x n ( X _x )) 1 n 

9x ' n nO ; ' 9x 


n 


n-l 


x 


10 


Our notation here is x Q = (x 1Q ,x 2Q , . . . ,x nQ ) . 

Hence we have computed the inverse of the linear part of the 

B i 

transformation T about x Q . We know the = L f h^ in (14) as func- 
tions of x^ ,x 2 , • • - ' x n * B Y employing equation (23) in the Taylor 
series of each about x^ and dropping terms higher than degree 1, 
we have the linear part of each F^ in the T coordinates. Hence we 
obtain equations (18) , (19) , and (20) . In this way we can build a 
linear system and a Luenberger observer which can possibly be useful 
near x^. Moreover, the computation above can be implemented on a 
computer . 

In [8] , the author considered a control system (3) , linearized 
this system about points as movement along a trajectory occurred, 
and used the various linearizations to determine control actions. 

It is our plan to use the observation model in much the same way, 
but for state estimation. 
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As mentioned in the introduction, suppose we take a system with 
controls and output 


m 

x = f(x(t)) + l u. ( t) g . ( x ( t ) ) 
i=l 1 1 

(24) y = h(x(t) ) 


and desire to build a state estimator to use in controlling the 
system. What conditions allow us to use an observation model at a 
point Xq which is independent of the controls applied? In other 
words, when can we parallel the method used for a linear system? A 
,Ph.D. student of the first author, Mladen Luksic, is making con- 
siderable progress on this problem. He is developing a theory, in- 
cluding application of mathematical topics like the Monge-Ampere 
equations, and is also working on practical simulation problems. 

Suppose we consider the time responses with initial state x Q of 
the nonlinear system (1) and the observation model (2), and for sim- 
plicity we assume both are single output systems. For (1) the out- 
put is 


n+1 


y ( t) = h (x Q ) 


k=l 


L^h (x Q ) 


t 

k! 


+ 


0 n+2. 

0 (/t| ) , 


and for (2) we have 

n+l ,k ~ 

y(t) = h(x Q ) + Z CA X f(x Q ) + 0(|tf ). 

k=l 

But L^h(Xg) = CA^ ^f (X q) , k = 1,2,..., n+l since 

jdL^Xp) = CA k = 0,1,. ..,n. Hence these outputs starting 

n +2 * * 
when x=Xq have difference 0 (it i ). This can easily be generalized 

to multioutput systems. 
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